Keyword-based Discriminative Training of Acoustic Models1
نویسنده
چکیده
In this paper, we investigate a new discriminative training technique which focuses on optimizing a keyword error rate, rather than the error rate on all words. We hypothesize that improvements in keyword error rate correlate with improvements in understanding error rates. Keyword-based discriminative training is accomplished by modifying a standard minimum classification error (MCE) training algorithm so that only segments of speech relevant to keyword errors are used in the acoustic model training. When both the standard and keyword-based techniques are used to adjust mixture weights, we find that keyword error rate reduction compared to baseline maximum likelihood (ML) trained models is nearly twice as large for the keyword-based approach. The overall word accuracy is also found to be improved for keyword-based training, and we run several experiments to investigate this phenomenon.
منابع مشابه
A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling
We propose a keyword-boosted state-level minimum Bayes risk (sMBR) criterion for training DNN-HMM hybrid keyword search systems by enhancing acoustic detail of a given list of target keyword terms. The rationale behind the proposed discriminative training strategy is to place more acoustic modeling emphasis on states appearing in the given keywords. We observed a relative gain of 1.7 ~ 6.1% in ...
متن کاملA Novel Discriminative Score Calibration Method for Keyword Search
The performance of keyword search systems depends heavily on the quality of confidence scores. In this work, a novel discriminative score calibration method has been proposed. By training an MLP classifier employing the word posterior probability and several novel normalized scores, we can obtain a relative improvement of 4.67% for the actual term-weighted value (ATWV) metric on the OpenKWS15 d...
متن کاملSpeaker Recognition using keyword Hidden Markov Models and Support vector machines
New approaches to speaker and background model training have given rise to many recent developments in speaker recognition. Recently, various text-dependent approaches have surfaced, including a keyword Hidden Markov Models (HMM) approach [1]. This approach also deviates from the traditional bag-offrames approach by taking into account relationships in time among acoustic features for different...
متن کاملDiscriminative Utterance Verification For Connected Digits Recognition - Speech and Audio Processing, IEEE Transactions on
Utterance verification represents an important technology in the design of user-friendly speech recognition systems. It involves the recognition of keyword strings and the rejection of nonkeyword strings. This paper describes a hidden Markov model-based (HMM-based) utterance verification system using the framework of statistical hypothesis testing. The two major issues on how to design keyword ...
متن کاملTitle Placeholder
We propose a keyword-boosted state-level minimum Bayes risk (sMBR) criterion for training DNN-HMM hybrid keyword search systems by enhancing acoustic detail of a given list of target keyword terms. The rationale behind the proposed discriminative training strategy is to place more acoustic modeling emphasis on states appearing in the given keywords. We observed a relative gain of 1.7 ~ 6.1% in ...
متن کامل